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Abstract 

Convex rank tests are partitions of the symmetric group which have desirable geometric 
properties. The statistical tests defined by such partitions involve counting all permutations 
in the equivalence classes. Each class consists of the linear extensions of a partially ordered 
set specified by data. Our methods refine existing rank tests of non-parametric statistics, 
such as the sign test and the runs test, and are useful for exploratory analysis of ordinal 
data. We establish a bijection between convex rank tests and probabilistic conditional in- 
dependence structures known as semigraphoids. The subclass of submodular rank tests is 
derived from faces of the cone of submodular functions, or from Minkowski summands of the 
permutohedron. We enumerate all small instances of such rank tests. Of particular interest 
are graphical tests, which correspond to both graphical models and to graph associahedra. 

Keywords: braid arrangement, graphical model, permutohedron, polyhedral fan, rank 
test, semigraphoid, submodular function, symmetric group. 

1 Introduction 

The non-parametric approach to statistics was introduced by [16] via the method of permuta- 
tion testing. Subsequent development of these ideas revealed a close connection between non- 
parametric tests and rank tests, which are statistical tests suitable for ordinal data. Beginning 
in the 1950s, many rank tests were developed for specific applications, such as the comparison of 
populations or testing hypotheses for determining the location of a population. The geometry 
of these tests was explored in [6]. More recently, the search for patterns in large datasets has 
spurred the development and exploration of new tests. For instance, the emergence of microarray 
data in molecular biology has led to tests for identifying significant patterns in gene expression 
time series; see e.g. |26j . This application motivated us to develop a mathematical theory of 
rank tests. We propose that a rank test is a partition of S n induced by a map r : S n — ► T 
from the symmetric group of all permutations of [n] = {1, . . . ,n} onto a set T of statistics. The 
statistic t(tt) is the signature of the permutation tt £ S n . Each rank test defines a partition of 
S n into classes, where tt and tt' are in the same class if and only if t(tt) = t(tt'). We identify 
T = image (r) with the set of all classes in this partition of S n . Assuming the uniform distribu- 
tion on S n , the probability of seeing a particular signature t € T is 1/n! times |t~ (t)\. The 
computation of a p- value for a given permutation tt € S n leads to the problem of summing 

Pr(vr') = I-lr- 1 ^'))! (1) 
n! 

over permutations tt' with Pr(7r') < Pr(7r), a computational task to be addressed in Section 6. 

This paper is an expanded version of our note "Geometry of Rank Tests" which was presented 
in September 2006 in Prague at the conference Probabilistic Graphical Models (PGM 3). The 
emphasis of our discussion is on the mathematics underlying rank tests, and, in particular, on 
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the connection to statistical learning theory (semigraphoids). We refer to [H] for details on how 
to use our rank tests in practice, and how to interpret the p- values derived from ([I]). 

The five subsequent sections are organized as follows. In Section 2 we explain how existing 
rank tests in non-parametric statistics can be understood from our geometric point of view, and 
how they are described in the language of algebraic combinatorics |19j . In Section 3 we define 
the class of convex rank tests. These tests are most natural from both the statistical and the 
combinatorial point of view. Convex rank tests can be defined as polyhedral fans that coarsen 
the hyperplane arrangement of S n . Our main result (Theorem [6]) states that convex rank tests 
are in bijection with conditional independence structures known as semigraphoids [?] 115} [2"3"]. 

Section 4 is devoted to convex rank tests that are induced by submodular functions. These 
submodular rank tests are in bijection with Minkowski summands of the (n— l)-dimensional 
permutohedron and with structural imset models. These tests are at a suitable level of generality 
for the biological applications [144126] that motivated us. The connection between polytopes and 
independence models is made concrete in the classification of small models in Remarks [T711191 

In Section 5 we study the subclass of graphical tests. In combinatorics, these correspond 
to graph associahedra, and in statistics to graphical models. The equivalence of these two 
structures is shown in Theorem [221 The implementation of convex rank tests requires the 
efficient enumeration of linear extensions of partially ordered sets. Our algorithms and software 
are discussed in Section 6. A key ingredient is the efficient computation of distributive lattices. 
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2 Rank tests and posets 

A permutation tt in S n is a total order on the set [n] := {1, . . . , n}. This means that tt is a set of 
(2) ordered pairs of elements in [n]. For example, tt = {(1,2), (2,3), (1,3)} represents the total 
order 1 > 2 > 3. If tt and tt' are permutations then tt n tt' is a partial order. 

In the applications we have in mind, the data are vectors u € W 1 with distinct coordinates. 
The permutation associated with u is the total order tt = { (i,j) £ [n] x [n] : ui < Uj }. We shall 
employ two other ways of writing a permutation. The first is the rank vector p = (pi, . . . ,p n ), 
whose defining properties are {pi, . . . , p n } = [n] and pi < pj if and only if m < Uj. That is, the 
coordinate of the rank vector with value i is at the same position as the ith smallest coordinate 
of u. The second is the descent vector 5 = (Sifal ■■■ \5 n ). The descent vector is defined by 
U3i > u S i+1 f° r i = 1, 2, . . . , n— 1. Thus the ith coordinate of the descent vector is the position of 
the ith largest value of the data vector u. For example, if u = (11,7, 13) then its permutation 
is represented by tt = {(2, 1), (1, 3), (2, 3)}, by p = (2,1,3), or by 5 = (3|1|2). 

A permutation tt is a linear extension of a partial order P on [n] if P C tt, i.e. tt is a total 
order that refines the partial order P. We write C(P) C S n for the set of linear extensions of P. 
A partition r of the symmetric group S n is a pre-convex rank test if the following axiom holds: 

(PC) If t(tt) = r(vr') and tt" G C(tt n tt') then t{tt)=t(tt')=t(tt"). 
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Note that it" G C(ir n it') means it n 7r ; C 7r". The number of all rank tests r on [n] is the .BeZZ 
number B n \, which is the number of set partitions of a set of cardinality n\. 

Example 1. For n = 3 there are Bq = 203 rank tests, or partitions of the symmetric group S3, 
which consists of six permutations. Of these 203 rank tests, only 40 satisfy the axiom (PC). One 
example is the pre-convex rank test in Figure 1. Here the symmetric group S3 is partitioned 
into the four classes {(1|2|3)}, {(2|1|3)}, {(2|3|1)}, and {(1|3|2), (3|1|2), (3|2|1)} . 

Each class C of a pre-convex rank test r corresponds to a poset P on the ground set [n]; 
namely, the partial order P is the intersection of all total orders in that class: P = flvrec 7r - 
The axiom (PC) ensures that C coincides with the set C{P) of all linear extensions of P. The 
inclusion C C £(P) is clear. The proof of the reverse inclusion C(P) C C is based on the fact 
that, from any permutation it in C(P), we can obtain any other it' in C(P) by a sequence of 
reversals (a, b) 1— > (6, a), where each intermediate 7r is also in C(P). Consider any ttq G £(P) 
and suppose that tti G C differs by only one reversal (a, 6) 6 7To, (6, a) G 7Ti. Then (6, a) ^ P, so 
there is some TT2 G C such that (a, 6) G 7r2; thus, 7ro G £(-7Ti n ^2) by (PC). This shows ttq G C. 

A pre-convex rank test therefore can be characterized by an unordered collection of posets 
Pi, P2, . . . , Pk on [n] that satisfies the property that the symmetric group S n is the disjoint 
union of the subsets C(Pi), C{P-i), . . . , £(P&). This structure was discovered independently and 
studied by Postnikov, Reiner and Williams [TBI §3] who used the term complete fan of posets 
for what we shall call a convex rank test in Section 3. The posets Pi, P2, ■ ■ ■ , Pk that represent 
the classes in a pre-convex rank test capture the shapes of data vectors. In graphical rank tests 
(Section [5]), this shape can be interpreted as a smoothed topographic map of the data vector. 

Example 2 (The sign test for paired data). The sign test is performed on data that are paired 
as two vectors u = (u\,U2, ■ ■ ■ ,u m ) and v = (v±,V2, ■ ■ ■ ,v m ). The null hypothesis is that the 
median of the differences U{ — Vi is 0. The test statistic is the number of differences that are 
positive. This test is a rank test, because u and v can be transformed into the overall ranks of 
the n = 2m values, and the rank vector entries can then be compared. This test coarsens the 
convex rank test which is the MSS test of Section 4 with K, = {{1, m + 1}, {2, m + 2}, . . . }. 

Example 3 (Runs tests). A runs test can be used when there is a natural ordering on the 
data points, such as in a time series. The data are transformed into a sequence of 'pluses' 
and 'minuses,' and the null hypothesis is that the number of observed runs is no more than 
that expected by chance. Common types of runs tests include the sequential runs test ('plus' if 
consecutive data points increase, 'minus' if they decrease), and the runs test to check randomness 
of residuals, i.e. deviation from a curve fit to the data. A runs test is a coarsening of a convex 
rank test, known as up-down analysis [26\ §6.1.1], which is described in Example [7] below. 

These two examples suggest that many rank tests from classical non-parametric statistics 
have a natural refinement by a pre-convex rank test. However, not all tests have this property. 
Because many classical rank tests apply to loosely grouped data (e.g. data which are divided 
into two samples), the axiom (PC) is not always satisfied. In such cases, the pre-convex rank 
test is a first step, after which permutations are grouped together under additional symmetries, 
e.g., the permutations 5 = (1|2|3|4|5) and 5' = (5|4|3|2|1) might be identified. 

The adjective "pre-convex" refers to the following interpretation of the axiom (PC). Consider 
any two data vectors u and u' in R n , and a convex combination u" = An + (1 — \)u', with 
< A < 1. If tt,tt',7t" are the permutations of u,u',u" then tt" G C(tt n ir'). Thus the 
equivalence classes in M. n specified by a pre-convex rank test are convex cones. In the next 
section, we shall remove the prefix from "pre-convex" if the faces of these cones fit together well. 
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Figure 1: Illustration of a pre-convex rank test that is not convex. Cones are labelled by descent 
vectors, so 1|2|3 indicates the cone u\ > U2 > U3. This rank test is specified by the four posets 
Pi = {3<1,2<1,3<2}, P 2 = {K2,3<2,3<1}, P 3 = {3<2,1<3,1<2} and P 4 = {2<3}. 

3 Convex rank tests 

A fan in M. n is a finite collection T of polyhedral cones [27] which satisfies the following properties: 

(i) if C G T and C is a face of C, then C G J 7 , 

(ii) if C, C G J 7 , then C n C" is a face of C. 

Two vectors it and v in R n are permutation equivalent when m < Uj if and only if Vi < Vj, 
and Ui = Uj if and only if Vi = Vj for all i,j G [n]. Note that for two data vectors, each with 
distinct coordinates, they are permutation equivalent if and only if they have the same rank 
vector. The permutation equivalence classes (of which there are 13 for n = 3) induce a fan 
called the S n -fan. The arrangement of hyperplanes {xi = Xj} that defines these classes is also 
known as the braid arrangement, and its regions as the Weyl chambers of the Lie algebra sl(n). 
The maximal cones in the S^-fan, which are the closures of the permutation equivalence classes, 
are indexed by permutations 5 in S n . A coarsening of the S n -fan is a fan J- such that each 
permutation equivalence class of R n is fully contained in a cone C of J~ '. Such a fan T defines a 
partition of S n because each maximal cone of the S n -f&n is contained in some cone C G T . 

Definition 4. A convex rank test is a partition of the symmetric group S n which is induced by 
a coarsening of the 5 n -fan. We identify the fan with that rank test. 

We say that two maximal cones, indexed by 5 and 5', of the S^-fan share a wall if there 
exists an index k such that 8}. = 5' k+1 , 5k+i = S' k , and <5j = 5[ for i ^ {k, k + 1}. This condition 
means that the corresponding permutations 5 and 5' differ by an adjacent transposition. To 
such an unordered pair {5, 5'}, we associate the following (elementary) conditional independence 
(CI) statement: 

h -lh<5jfc + i|{5i,...,5jfe_i}. (2) 

The notation was coined by Dawid [7J, where it is used to formally describe conditional inde- 
pendence among sets of random variables; we will see the connection shortly. For k = 1 we use 
the standard convention to abbreviate Si _LL 62 \ { } by Si _LL 5% . 

Example 5. For n = 3 there are 40 pre-convex rank tests (Example [1]) , but only 22 of them are 
convex rank tests. The corresponding CI models are shown in Figure 5.6 on page 108 in [23J. 

The formula ([2]) defines a map from the set of walls of the 5 n -fan onto the set 

T n := {iALj\K : K C [n]\{i,j}}. 
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of all elementary CI statements. In this manner, each wall of the S" n -fan is labeled by a CI 
statement. The map from walls to CI statements is not injective; there are (n — k — 1)1 (k — 1)! 
walls which are labeled by ([2]). 

The S n -fan is the normal fan [27J of the permutohedron P n , which is the (n — l)-dimensional 
convex hull of the vectors (pi, . . . , p n ) E M. n , where p runs over all rank vectors of permutations 
in S n . Each edge of P n joins two permutations if they differ by an adjacent transposition. In 
other words, each edge corresponds to a wall and is thus labeled by a CI statement. A collection 
of parallel edges of P n that are perpendicular to a given hyperplane {xi = Xj} corresponds to 
the set of CI statements iALj\K, where K ranges over all subsets of 

The two-dimensional faces of P n are squares and regular hexagons, and two edges of P n 
have the same label in T n if, but not only if, they are opposite edges of a square. Figure 2(c) 
depicts the subset of P5 in which the last two coordinates of u £ W 1 are less than or equal to all 
other coordinates. It consists of two copies of the hexagon in 2(a), with the final two entries of 
the descent vector either 4|5 (in the top hexagon) or 5|4 (in the bottom hexagon). All vertical 
edges are labeled by the CI statement 4_LL5|{1, 2, 3}. 
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(a) (b) (c) 

Figure 2: (a) The permutohedron P3 and (b) the 53-fan projected to the plane. The indicated 
rank test is up-down analysis. Each permutation is represented by its descent vector 5 = <5i|#2|^3- 
Missing walls of the S n -fan, or solid edges of P n , are labelled by CI statements, (c) Edges of 
the permutohedron on opposite sides of a square (here, all vertical edges) are labelled by the 
same CI statement; hexagonal prisms such as the one pictured here appear in P n for n > 5. 

Any convex rank test T is characterized by the collection of walls {5,5'} that are removed 
when passing from the S^-fan to T . So, from ([2]), any convex rank test T maps to a set Mjr 
of CI statements corresponding to missing walls, or a set Mjf of edges of the permutohedron. 
For example, if T is the fan obtained by removing the two dashed rays in Figure 2 (b) then the 
corresponding set of CI statements is A4jr = {l_LL3|0, 1_LL3|{2}|. 

Conditional independence statements [7J describe the dependence relationship among ran- 
dom variables. A semigraphoid is a set A4 of general conditional independence statements 
satisfying certain properties [15]. These general conditional independence statements, in con- 
trast to the elementary CI statements already introduced, can take subsets of [n] in their first 
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two arguments. The conditions are, for X,Y, Z pairwise disjoint subsets of [n], 

(SGI) X ALY\Z £ M Y 1L X \ Z e M 

(SG2) X ALY\Z £ M andU C X => U ALY\Z G M 

(SG3) X ALY\Z € M andU C X => X ALY\(UUZ) G M 

(SG4) X _U_ F | Z G .M and X _U_ W | (Y" U Z) =► X _U_ (W U Y") | Z G At 

It was shown by Studeny [21] that these are not a complete set of axioms for probabilistic 
conditional independence, although they are true of any probabilistic model. A semigraphoid is 
determined by its trace among statements of the form i _LL j \ K where i and j are singletons. 
Namely, I _I_L J\K holds if and only if i _LL j\L for all i G I,j G J and L such that K C L C 
(/ U J U iT) \ £j; see [12]. Casting the semigraphoid axiom in terms of the trace, we say that a 
subset A4 of T n is a semigraphoid if i _LL j j K G A4 implies j _LL i \ K G M and the following 
axiom holds: 

(SG) i ALj\KUl e M and i_U_£|KG7W 

implies iALj\KeM and i _LL I \ KUj G A4. 

This axiom is stated in |131l23j . Our first result is that semigraphoids and convex rank tests are 
the same combinatorial object: 

Theorem 6. The map T i— > TWjf is a bijection between convex rank tests and semigraphoids. 

Before presenting the proof of this theorem, we shall discuss an example. 

Example 7 (Up-down analysis). Let T denote the convex rank test called up-down analysis 
[26] . In this test, each permutation tt G S n is mapped to the sign vector of its first differences, 
or, equivalently, its descent set. Thus this test is the natural map r : S n — > {— ,+} n ~ 1 . The 
corresponding semigraphoid Aip consists of all CI statements i _LL j \ K where \i — j\ > 2. 

This convex rank test is visualized in Figure 2(a,b) for n = 3. Permutations are in the 
same class (have the same sign pattern) if they are connected by a solid edge; there are four 
classes. In the S^-fan, the two missing walls are labeled by conditional independence statements 
as defined in ([2|). For n = 4 the up-down analysis test T is depicted in Figure 3. The double 
edges correspond to the twelve CI statements in M^. There are eight classes; e.g., the class 
{3|4|1|2, 3|1|4|2, 1 1 3 1 4 1 2 , 1|3[2|4, 3|1|2|4} consists of the five permutations in S4 which have the 
up-down pattern (—,+,—). 

Our proof of Theorem [6] rests on translating the semigraphoid axiom (SG) into geometric 
statements about edges of the permutohedron. Recall that a semigraphoid Ad can be identified 
with the set M of edges of the permutohedron whose CI statement labels are those of A4. 

Observation 8. A set M of edges of the permutohedron P n is a semigraphoid if and only if 
the set M satisfies the following two geometric axioms: 

Square axiom: Whenever an edge of a square is in M, then the opposite edge is also in M. 

I ! - I I 

Hexagon axiom: Whenever two adjacent edges of a hexagon are in M, then the two opposite 
edges of that hexagon are also in M. 
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Figure 3: The permutohedron P4 with vertices marked by descent vectors 8 (bars | omitted). 
The convex rank test indicated by the double edges is up-down analysis. 

Let M be the subgraph of the edge graph of P n defined by the statements in M; that is, M 
consists of edges whose labels are in A4. Each class of the rank test defined by M. consists of the 
permutations in some connected component of M. We regard a path from 8 to 8' on P n as a word 
(ji 1 ) . . . a ( l ) i n the free associative algebra A generated by the adjacent transpositions of [n]. For 
example, the transposition 023 := (23) gives the path from <5 to 8' = 0-238 = 8i\8s\82\84 \ . . . \8 n . 
The following relations in A define a presentation of the group algebra of S n as a quotient of A: 

(BS) CTi,i+l ■ <Ji+k+l,i+k+2 — &i+k+l,i+k+2 " 

(BH) CTj ; j + i • ■ cr^j+i — • Oi : i + iOi + i : i + 2, and 

(BN) al +1 - 1, 

where suitable i and k vary over [n] . The first two are the braid relations, and the third represents 
the idempotency of each transposition. 

Now, we regard these relations as properties of a set of edges of P n , by identifying a word 
and a permutation 8 with the set of edges that comprise the corresponding path in P n . For 
example, a set satisfying (BS) is one such that, starting from any 8, the edges of the path 
Oi,i+iOi + k+i,i+k+2 ar e in the set if and only if the edges of the path Oi + k+i : i+k+2°~i,i+i are in the 
set. Note then, that (BS) is the square axiom, and (BH) is a weaker version of the hexagon axiom 
of semigraphoids. That is, implications in either direction hold in a semigraphoid. However, 
(BN) holds only directionally in a semigraphoid: if an edge lies in the semigraphoid, then its 
two vertices are in the same class; but the empty path at some vertex 8 certainly does not imply 
the presence of all incident edges in the semigraphoid. Thus, for a semigraphoid, (BS) and (BH) 
hold, but (BN) must be replaced with the directional version 

(BN') 4 +1 ^ 1. 

We now consider a path p from 8 to 8' in a semigraphoid. Here is a crucial lemma for our proof: 

Lemma 9. Suppose that M. is a semigraphoid. If 8 and 8' lie in the same class of M., then so 
do all shortest paths on P n between them. 
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The lemma in turn depends on the following version of a classical result due to Jacques 
Tits. This result, which can be found in [U p. 49-51]), essentially states that the relations 
(BS),(BH),(BN) form a Grobner basis for the two-sided ideals they generate in A. 

Theorem 10 (Tits |24|). Let p and q be words representing paths on P n . 

(1) A word p is (BS),(BH),(BN)-reduced if and only if it is (BS), (BE), (BN') -reduced. 

(2) If p and q are reduced, then they represent the same element of the symmetric group S n if 
and only if p can be transformed to q by the the application of (BS) and (BH) only. 

Proof of Lemma\Q Theorem [10] (1) says that if there is any path connecting 5 and 5' , then there 
is a shortest path connecting them. Thus if 5 and 5' lie in the same class of M, some shortest 
path 5 —> 5' also lies in that class. Now (2) says that if p and q are both shortest paths, then q 
can be obtained from p by application of only the square and hexagon axioms, (BS) and (BH). 
Thus if any shortest path 5 — > 5' lies in the class of A4 containing them both, so do all other 
shortest paths connecting them. □ 

We need one lemma to deal with intersections of nonmaximal cones. Denote by -< the 
transitive relation "is a face of" and write F W (C) for the face of a cone C at which w is minimized. 

Lemma 11. If the intersection of two cones C\ and C 2 is a face of both, then the intersection 
of any faces D -< C\ and E -< C 2 is a face of both. 

Proof. By transitivity of -< and the hypothesis it suffices to show DnE -< C\f\d- Since D <C\, 
there exists a linear functional w such that the face F W (C\) equals D and C\ n C 2 C C\ C H£. 
Then F w (d n C 2 ) = D n C 2 so D n C 2 -< d n C 2 . Similarly, E n d -< d n C 2 . Then since the 
intersection of any two faces of C\ D C 2 is also a face, D n E -< C\ n d as desired. □ 

Proof of Theorem 0. Both semigraphoids and convex rank tests can be regarded as sets of edges 
of P n . We first show that a semigraphoid satisfies (PC). Consider 5, 5' in the same class C of a 
semigraphoid, and let 5" G C(5n5'). Further, let p be a shortest path from 5 to 5" (so, pS = 5"), 
and let q be a shortest path from 5" to 5' . We claim that qp is a shortest path from 5 to 5' , and 
thus 5" G C by Lemma [9j Suppose is not a shortest path. Then, we can obtain a shorter 
path in the semigraphoid by some sequence of substitutions according to (BS), (BH), and (BN'). 
Only (BN') decreases the length of a path, so the sequence must involve (BN'). Therefore, there 
is some i, j in [n], such that their positions relative to each other are reversed twice in qp. But 
p and q are shortest paths, hence one reversal occurs in each of p and q. Then 5 and 5' agree 
on whether i > j or j > i, but the reverse holds in 8", contradicting 5" G C(S D 6'). Thus every 
semigraphoid is a pre-convex rank test. 

Now, we show that a semigraphoid corresponds to a fan. We first argue that we may 
reduce to the case of two maximal cones, each coming from a class in the semigraphoid, whose 
intersection is codimension one in both. By Lemma [TT1 we can consider maximal cones only. 
Suppose two maximal cones d, Cfc have intersection C\ Pi Ck which is not codimension one. 
Then there exists a sequence of maximal cones C%, d, ■ ■ ■ , Ck such that d^d+i is codimension 
one, d n C k C Ci n Cj+i for all i = 1, ... k - 1, and in fact d n C k = d n C 2 n • • • n C k . We 
have that (Cj n Cj+i) H (C,;+i n Ci +2 ) is a face of Cj+i and Cj + 2 by Lemma [TT], and also is a face 
of Cj. Thus Cj fl Cj + i n Cj+2 -< d, Ci + \ , Ci +2 ; continuing in this manner, we eventually get that 
C\ D C2 n • • • n Ck -< d, C k as required. 

Consider the cone corresponding to a class C. We need only show that its codimension one 
intersection with another maximal cone is a shared face. Since C is a cone of a coarsening of 
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the S^-fan, each facet of C lies in a hyperplane H = {x{ = xj}. Suppose a face of C coincides 
with the hyperplane H and that i > j in C. A vertex 5 borders H if i and j are adjacent 
in 5. We will show that if 5,8' € C border H, then their reflections 8 = 8\ \ . . . \ j\i\ . . . \8 n and 
8' = 5[ \ . . . \ j\i\ . . . \5' n both lie in some class C. Consider a 'great circle' path between 5 and 5' 
which stays closest to H: all vertices in the path have i and j separated by at most one position, 
and no two consecutive vertices have i and j nonadjacent. This is a shortest path, so it lies in C, 
by Lemma [9) Using the square and hexagon axioms (Observation [8J, we see that the reflection 
of the path across H is a path in the semigraphoid that connects 5 to 5' (Figure 3). This shows 
that the intersection of C and C is a face of both. Thus a semigraphoid is a convex rank test. 

Finally, if M is a set of edges of P n representing a convex rank test, then it is easy to show 
that M satisfies the square and hexagon axioms. □ 

5' 



5> 

Figure 4: Reflecting a path across a hyperplane. 

4 The submodular cone 

In this section we focus on a subclass of the convex rank tests. Let 2^ denote the collection 
of all subsets of [n] = {1,2,... ,n}. Any real-valued function w : 2^ — > R defines a convex 
polytope Q w of dimension < n — 1 as follows: 

Qw ■= {x £R n : xi + x 2 -\ \-x n = w([n]) 

and Eie/ x i ^ u '( / ) for all ^ I C [n] }. 

A function w : 2^ — > R is called submodular if w(I)+w(J) > w(lDJ) + w(IUJ) for /, J C [n]. 
The submodular cone is the cone C n of all submodular functions w : 2^ — ► R. Working modulo 
its lineality space C n n (— C n ), we regard C n as a pointed cone of dimension 2 n — n — 1. 

Studying functions w means that in considering the normal fan of a polytope Q w , we want 
to retain information about non-binding inequalities that are just barely so, i.e. that hold with 
equality. For this reason we define the vector (normal) fan pQ. The indicator function of each 
I € 2W defines a vector ej in the 1-skeleton of the 5" n -fan, understood modulo et n y, for example, 
these vectors for n = 3 are eooi, eoio> e i00i eon, . . . , em. A vector fan g" is a collection of subsets 
of {ej : I G 2^} such that U,V E $ implies U D V G g. A vector fan defines a usual fan by 
taking the maximal cones of the fan to be the cones generated by the vector sets in the vector 
fan. We say that a vector fan is complete if its fan is. A vector fan $ coarsens another vector 
fan if for all U G <S, there exists Fe5 with U C V. 

Given a function w : 2^ — > R, each / G 2^ defines an inequality Y^iel Xi — Wl a PP ear i n g 
in the definition of Q w ; the vector normal fan tells us which of these inequalities holds with 
equality on some face of Q w . We define the vector normal fan of a function w : 2^ — > R as the 
set {{ej : I G 2^ n \ xi = wj for all x G F} for each face F G Q w }. The vector normal fan of 
w defines a fan which is the normal fan of Q w and retains additional information. 
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Proposition 12. A function w : 2^ — > IR is submodular if and only if the vector normal fan 
of w is a coarsening of the vector S n -fan. 

Example 13. Let wi = W2 = = l,wi% = W13 = W23 = ^123 = 3. The polytope Q w 
is the point (1, 1, 1) but the function w is not submodular. The vector normal fan $ of w is 
{{ e ooii e oiOi e ioo}} an d the normal fan is all of M 3 /(l, 1, 1). $ does not coarsen the 5 n -fan since, 
for example, eno is not contained in any set in 

However, if we change w slightly to define the same Q w but with the inequalities correspond- 
ing to Oil, 101, and 110 also holding with equality, e.g. w± = 1V2 = W3 = 1, wu = W13 = W23 = 2, 
and W123 = 3, the resulting vector normal fan of w is a coarsening of the (vector) S* n -fan. 

Proof. We show only the if direction of Proposition [T2J Suppose uu is not submodular. Then 
there exist I, J C 2l n l such that 



wi + wj < wmj + wiuj 



We also have that 



^ ^ Xj + ^ ^ Xi — ^ ^ Xi + ^ ^ Xi 

giuj ieinj iei «gJ 

< W! + wj < w In j + w IUJ 



So ZliG/uJ x * < w iuJ+(winJ-YlielnJ x i) and similarly J]i e /n J x * < w ir\J+(wiuJ-YlieluJ x i)> 
so that at most one of the inequalities corresponding to / U J and I Pi J can hold with equality 

at any point of Q w . Then any set in the vector normal fan of w either fails to contain e/nj or 

fails to contain e/uj. □ 

Proposition [12] can be paraphrased as follows: the function w is submodular if and only if 
the optimal solution of 

maximize u ■ x subject to x € Q w 

depends only on the permutation equivalence class of u. Thus, solving this linear programming 
problem constitutes a convex rank test. Any such test is called a submodular rank test. 

A convex polytope is a (Minkowski) summand of another polytope if the normal fan of the 
latter refines the normal fan of the former. The polytope Q w that represents a submodular rank 
test is a summand of the permutohedron P n . 

Theorem 14. The following combinatorial objects are equivalent for any positive integer n: 

1. submodular rank tests, 

2. summands of the permutohedron P n , 

3. structural conditional independence models I23\j , 

4. faces of the submodular cone C n in R 2 ™ . 

Proof. We have 1 <^=^ 2 from Proposition 1121 and 1 <^=^ 3 follows from [23]. Further, 1 <^=^> 4 
is a direct consequence of our definition of submodular rank tests. □ 

Remark 15. All 22 convex rank tests for n = 3 are submodular. The submodular cone C3 is 
a 4-dimensional cone whose base is a bipyramid. Its f-vector is (1, 5, 9, 6, 1). The polytopes Q w , 
as w ranges over representatives of the faces of C3, are all the Minkowski summands of P3. 

Proposition 16. For n > 4, there exist convex rank tests that are not submodular rank tests. 
Equivalently, there are fans that coarsen the S n -fan but are not the normal fan of any polytope. 
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Proof. This result is well-known. It is stated in Section 2.2.4 of [23] in the following form: 
"There exist semigraphoids that are not structural." □ 



An interesting example which also proves Proposition [16] is the following semigraphoid: 

M = {2 _U_ 3|{1,4}, 1 _LL 4|{2,3}, 1 _U_ 2|0, 3 _LL 4|0}. 

The corresponding fan consists of unimodular cones, or, equivalently, the posets Pi representing 
this non-submodular convex rank test are all trees. This example answers a question posed in 
the first version of [18] . A systematic method for showing that a semigraphoid is not submodular 
can be found in [9]. Results in that paper include an example of a coarsest semigraphoid which 
is not submodular and a proof that the semigraphoid semigroup is not normal. 

Remark 17. For n = 4 there are 22108 submodular rank tests, one for each face of the 11- 
dimensional cone C4. The base of this submodular cone is a 10-dimensional polytope with /- 
vector (1, 37, 356, 1596, 3985, 5980, 5560, 3212, 1128, 228, 24, 1). The 37 vertices of this polytope 
correspond to the maximal semigraphoids. These come in seven symmetry classes up to the * 
involution ([3]) and the S^-action. The types of maximal semigraphoids for n = 4 are displayed 
in the following table: 



Symmetry 


No. 


iALj 


iALj\k 


iALj\{k,l} 


lx and * 


2 


all 


all 


none 


4x and * 


8 


all 


all but 2_LL3|1,1_LL3|2, 1_LL2|3 


34L4|12,2JL4|13,14L4|23 


6x incl. * 


G 


all but 1_LL2 


all but 14L2|3,1JL2|4 


all but 1U_2|34 


4x and * 


8 


all 


2_U_3|4,2_U_4|3,3_U_4|2 


31L4|12,2U_4|13,21L3|14 


lx, self-* 


1 


all 


none 


all 


6x incl. * 


G 


all but 1AL2 


2JL3|1,2JL4|1,1JL3|2,1JL4|2 


all but 3JL4|12 


6x incl. * 


6 


3_L4 


all but 2_U_3|4,2_U_4|3, 1_U_4|3, 1_U_3|4 


1±L2|34 



Remark 18. For n = 5 there are 117978 coarsest submodular rank tests, in 1319 S5 symmetry 
classes. We confirmed this result of [22J with POLYMAKE [S]. 

We now define a class of submodular rank tests, which we call Minkowski sum of simplices 
(MSS) tests. Note that each subset K of [n] defines a submodular function wk by setting 
wk{I) = 1 if K fl / is non-empty and wk{I) = if K H / is empty. The corresponding polytope 
Q WK is the simplex Ak = convjefc : k € K}. 

Now consider an arbitrary subset K, = {K±, K2, ■ ■ ■ , K r } of 2^. It defines the submodular 
function w/c = wk 1 + wk 2 + • • • + WK r - The corresponding polytope is the Minkowski sum 

A K = A Kl +A K2 + --- + A Kr . 

The associated MSS test t;c is defined as follows. Given p £ S n , we compute the number of 
indices j G [r] such that max{pk : k G Kj} = pi, for each % 6 [n\. The signature Tic(p) is the 
vector in N n whose ith coordinate is that number. Few submodular rank tests are MSS tests: 

Remark 19. For n = 3, there are 22 submodular rank tests, but only 15 of them are MSS tests. 
For n = 4, there are 22108 submodular rank tests, but only 1218 of them are MSS tests. 

In light of Theorem [6] it is natural to ask which semigraphoids correspond to an MSS test. 
Geometrically, we wish to know which edges of the permutohedron P n are contracted when 
passing to the polytope Q WK . To be precise, let Adjc denote the semigraphoid derived from T WKi 
using the bijection in Theorem [6j We then have the following result: 
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Proposition 20. The semigraphoid Mkl ^ s the set of CI statements of the form i _LL j \ K 
where all sets containing {i,j} and contained in {i,j} U are not in JC. 

Proof. Consider two permutations 5 and 5' which are adjacent on the permutohedron P„, and 
let i _LL j | K be the label of the edge that connects 5 and 5' . That CI statement is in A4fc if 
and only if 5 and 5' are mapped to the same vertex in A^ if and only if 5 and 5' are mapped to 
the same vertex in each simplex A^- ; for / = 1,2, . . . , r. For each I, this means that the leftmost 
entry of the descent vector 5 that lies in K\ agrees with the leftmost entry of the other descent 
vector 5' that lies in K\. This condition is equivalent to 

If, D (tf U + {i,j} for / = l,2,...,r. 

Thus i Jl_ j | K is in the semigraphoid Mk, associated with the set family K, if and only if fC 
contains no set whose intersection with K U {i, j} equals {i, j}. This is precisely our claim. □ 

There is a natural involution * on the set of all CI statements which is defined as follows: 

(iALj\C)* := ilLj\[n}\(CU{i,j}). (3) 

If A4 is any semigraphoid, then the semigraphoid Ai* is obtained by applying the involution * 
to all the CI statements in the model A4 . This involution is referred to as duality in [11] . In the 
boolean lattice, whose elements are the subsets of [n], the involution corresponds to switching 
the role of set intersection and set union. 

The MSS test tk, was defined above in terms of weight functions w. What follows is a similar 
construction for the duals of MSS tests. Let z/c(J) = 1 for J € K, and z/c(J) = otherwise. 
Then the function w* : 2^ — > M. defined by w^(I) := YLjri Z K.(J) is supermodular. We set 

Q* w := { x G M n : x% + x 2 + ■ • • + x n = w([n]) 
and Eie/ x i ^ W ( T ) for all / / C [n] }. 

Then the equality Q^ K = A# holds for = A^ + Ak 2 + • • • + Ax r • This equality is 
precisely the statement in Proposition 6.3 of Postnikov's paper |17j . 

5 Graphical tests 

We have seen that semigraphoids are equivalent to convex rank tests. We now explore the 
connection to graphical models. Let G be a graph with vertex set [n] and /C(G) the collection of 
all subsets K C [n] such that the induced subgraph of G\k is connected. The undirected graphical 
model (or Markov random field) derived from the graph G is the set M of CI statements: 

A4 G = { i _LL j | C : the restriction of G to [n]\C contains no path from i to (4) 

Theorem 21. The set A4 G of CI statements in the graphical model G is egual to the semi- 
graphoid M)c(G) associated with the family IC(G) of connected induced subgraphs of G. 

Proof. The defining condition in is equivalent to saying that the restriction of G to any node 
set containing {i,j} and contained in {i,j} U ([n]\C) is disconnected. With this observation, 
Theorem 1211 follows directly from Proposition 1201 □ 
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The polytope = Ajc(g) associated with the graph G is the graph ass octahedron. This is a 
well-studied object in combinatorics [13 [5]. Carr and Devadoss [5] showed that Aq is a simple 
polytope whose faces are in bijection with the tubings of the graph G. Tubings are defined as 
follows. Two subsets A, B C [n] are compatible for G if one of the following conditions holds: 
A C B, B C A, or A n B = 0, and there is no edge between any node in A and B. A tubing of 
the graph G is a subset T of 2^ such that any two elements of T are compatible. The set of 
all tubings on G is a simplicial complex; it is dual to the face lattice of the simple polytope Aq. 

For any graph G on [n] we now have two convex rank tests. First, there is the graphical model 
rank test tw^q , which is the MSS test of the set family JC(G). Second, we have the graphical 
tubing rank test Twgj, which is the convex rank test associated with the semigraphoid (M. )* 
dual to A4 G . Explicitly, that dual semigraphoid is given by 

(Ai G )* = { i -LL j | C : the restriction of G to C U {i,j} contains no path from i to (5) 




3214 2314 



Figure 5: The permutohedron P4. Double edges indicate the MSS test t^(g) where G is the 
4-chain. Edges with large dots indicate the dual tubing test . 

We summarize our discussion in the following theorem: 

Theorem 22. The following four combinatorial objects are isomorphic for any graph G on [re]: 

• the graphical model rank test t^iq\ , 

• the graphical tubing rank test Tw™, 

• the fan of the graph associahedron Aq, 

• the simplicial complex of all tubings on G. 

We note that when the graph G is a path of length re, Aq is the associahedron, and when it 
is an re-cycle, Aq is the cyclohedron. The number of classes in either the MSS test t^(g) or the 
tubing test is the G-Catalan number of [17J. This number is the classical Catalan number 

^j-j-( 2 ™) for the associahedron test. It equals ( 2 ™Ii) f or the cyclohedron test [14] . 
Example 23. Let n = 4 and let G be the 4-chain 1 — 2 — 3 — 4. Then 

M G = {l _LL 3 I 24, 1 _LL 4 I 23, 2 _U_ 4 | 13, 1 _LL 3 | 2, 1 _LL 4 | 2, 1 U_ 4 | 3, 2_U_4|3}, 
(M G )* = {1 4L 3, 1_U_4, 2_U_4, 1 _U_ 3 | 4, 1 _U_ 4 [ 3, liL4|2, 2 _LL 4 | 1} . 
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The corresponding tests t^g) an d Tjc(Cf) are depicted in Figure [SJ Note that contracting either 
class of marked edges on the permutohedron in Figure [5] leads to the 3-dimensional associahedron 
Aq. The associahedron is the Minkowski sum of the simplices where K runs over 

K(G) = {{1}, {2}, {3}, {4}, {1, 2}, {2, 3}, {3, 4}, {1, 2, 3}, {2, 3, 4}, {1, 2, 3, 4}} . 

The 3-dimensional simple polytope A4 has 14 vertices, one for each of the 14 tubings of G. □ 

In our application of graphical rank tests, we found it more natural to work with the tubing 
test T^ G j instead of the MSS test r^ G y We refer to our companion paper [T3] which gives 
a detailed discussion of the cyclohedron test and its applications. By the cyclohedron test we 
mean the tubing test Tw g n where the graph G is a cycle of length n. 



O O 



Figure 6: Tubing of the 6-chain. Encircled regions indicate the sets Uj. 

Applying the tubing test to a data vector u G M n can be viewed as an iterative procedure 
for drawing a topographic map on the graph G. Namely, we encircle the vertices of G by sets 
Ui, . . . , U n in the order 61,62, ■■■ , 6 n -\, with the following provision: if <5j is next to be encircled 
and shares an edge with some vertex j which has already been encircled by some Uj, then Ui must 
also contain the circle Uj. The result is a collection U of n — 1 encircled sets U±, U%, . . . , J7 n -i, 
and this unordered collection of sets is the signature of v. The height hi of the i-th. node in the 
topographic map for v is the number of sets Uj which contain i. We can identify the signature 
U with the height vector h = (hi, /12, • • • , h n ), since U can be recovered uniquely from the vector 
h. The map u 1— > h(u) can be interpreted as a smoothing of the data. Figure 6 displays the 
topographic map when the data vector is u = (2.1, 0.3, 1.8, , 2.0, 1.1, 0.1). Here G is the 6-chain 
1 — 2 — 3 — 4 — 5 — 6. and the descent vector of u equals 6 = (1|5|3|2|4|6). 

6 On counting linear extensions 

In this paper, we have introduced a hierarchy of rank tests, which range from pre-convex to 
graphical. Convex rank tests are applied to data vectors u G W 1 , or permutations ir G S n , and 
determine their cones in a fan T which coarsens the S n -fan. The significance of a data vector 
in such a test is measured by a certain p- value, whose precise derivation is described in |14j . 
Computation of that p- value rests on our ability to compute the quantity |r~ 1 (r(7r)) |, which 
is the number of permutations in the maximal cone of T corresponding to tt. Recall that the 
cones of a convex rank test are indexed by posets Pi, P2, ■ ■ ■ , Pk on [n], and our computations 
amount to finding the cardinality of the set C(Pj) of linear extensions of Pi. 

The problem of computing linear extensions of general posets is #P-complete [2], so our 
task is an intractable problem when n grows large. However, for special classes of posets, and 
for moderate values of n, the situation is not so bad. For example, in the up-down analysis of 
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Willbrand et al. (see Example [7]) , we need to count all permutations with a fixed descent set, a 
task for which an explicit determinantal formula appears in Stanley \19\ page 69]. We refer to 
[3] for a detailed study of the combinatorics of these up- down numbers. 

Likewise, there is an efficient (and easy-to-implement) method for the computing quantities 
| r _1 (r(7r)) I for any graphical graphical tubing test T^ G y as defined in Section 5. Indeed, here 
the fan T is unimodular, and hence the posets Pi are all trees. The special trees arising from 
a graph G in this manner are known as G '-trees \\.7\ [5]. The G-tree of a permutation tt is a 
representation of the poset Pi as a tree T = t^ g n (tt) with the minimum value as the root and 
maximal values as the leaves. Suppose the root of the tree T has k children, each of which is a 
root of a subtree T l for i = 1, . . . ,k. Writing \T l \ for the number of nodes in T l , we have 

This recursive formula translates into an efficient iterative algorithm. Our implementation of 
this algorithm, when G is the n-cycle, is the workhorse behind our computations in [T5]. For a 
graph G, let nbhd(i) be the set of vertices j such that there is an edge in G. 

Algorithm 24. (Permutation Counting) 

Input: A data point u as a descent permutation 5 and a graph G. 

Output: The number of permutations with the same signature as S, | t~ 1 t(tt(u)) |. 

Initialize: 

An indexed set of largest enclosing sets LE\ = • • • = LE n = 0, and counter c = 1 
for Si in 5: 

Initialize I an empty list of enclosed tree lengths 

LE 5i = {5i} 

for j in nbhd(^j): 

if LEj and j £ LE 5 .: 
LE 5l = LE Sl U LEj 
append \LEj\ to t 
c = c.ff) 
for j in LE Si : 
LEj = LE$. 
Return the permutation count c 

In the remainder of this section we discuss our method for performing these computations 
for an arbitrary convex rank test. The test is specified (implicitly or explicitly) by a collection of 
posets Pi, . . . ,Pfi on [n]. From the given permutation, we identify the unique poset Pi of which 
that permutation is a linear extension, and we construct the distributive lattice L{Pj) whose 
elements are the order ideals of Pj . Recall that an order ideal of Pi is a subset O of [re] such that 
if I € O and (k, I) G Pi then k £ O. The set of all order ideals is a distributive lattice with meet 
and join operations given by set intersection O HO' and set union OUO'. 

The distributive lattice L(Pi) is a sublattice of the Boolean lattice 2l n l, whose nodes are the 
2™ subsets of [n] = {1,2,..., n}, and we represent L(Pi) by its nodes and edges (cover relations) 
in 2^ a \ We write each edge in 2^ as a pair (K,l) where K C [re] and / € [n]\X. The edge in 
the Boolean lattice 2^ represented by the pair (K, I) is the cover relation K C K U {I}. 

Permutations in S n are in natural bijection with maximal chains in the Boolean lattice 
2N. For example, the descent permutation 5 = (4|2|1|3) corresponds to the maximal chain 
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(0, {4}, {2, 4}, {1, 2, 4}, {1, 2, 3, 4}) in the Boolean lattice 2^1 If the poset P is the linear order 
6 then L(Pi) is the subgraph of 2^1 consisting of the five nodes in the chain and the four edges 
(0,4), ({4}, 2), ({2,4}, 1) and ({1,2, 4}, 3) which connect them. The maximal chains in 2 M 
that lie in the sublattice L(Pi) are precisely the permutations that are linear extensions of P{. 
Therefore our task is to construct L{Pi) and then count its maximal chains. 

Remark 25. The linear extensions of the poset Pi are in bijection with the maximal chains in 
the distributive lattice L(Pi). See [19\ Section 3.5] for further information on this bijection. 

In general, L{Pi) is the graph whose nodes are those subsets of [n] which are order ideals in 
Pi, and the edges are (K, I) where both K and K U {1} are order ideals in Pj. Our strategy in 
computing the graph which represents L(Pi) is as follows. We start with a given permutation 6 
which lies in the class indexed by Pj. That permutation determines a maximal chain in 2^ which 
must lie in L(Pj). We then compute a certain closure of that subgraph in 2^ with respect to 
the semigraphoid A4 under consideration. This is precisely what is done in Algorithm 21 below. 
Knowledge of the distributive lattice L{Pi) solves our problem since the number of maximal 
chains of L{Pi) can be read easily from the representation of L{Pi) in terms of nodes and edges. 

Algorithm 26. (Building the Distributive Lattice) 

Input: A data point as a descent permutation 6 and a semigraphoid A4. 

Output: A distributive lattice L{Pi) representing the class of 5 in the convex rank test Ai. 

Initialize: 

A set of confirmed lattice nodes, EI = {0, {<5i}, {61,62}, • • • , {61, . . . , 6 n }} 
A set of checked lattice edges, E = {({5i, • • • , <5 n _i}, <5 n )}, 

where each pair has the form (history, next position). 
A stack of edges waiting to be checked: 

W = [(0),6 1 ),({6 1 },6 2 ),({6 1 ,6 2 },6 3 ),...,({6 1 ,...,6 n - 2 },6 n ^)] 

While W + 0: 

Pop (H, i) from the stack W 
Add (H, i) to E 

for j such that (H U {i},j) G E: 
if ilLj\H G M: 
Add (H,j) to E 
if HU{j} £ H: 
Add H U {j} to H 
Push (HU{j},i) onto W 
Return the distributive lattice L(Pi) = (M, E) 

Our program for performing rank tests implements Algorithm [26l It accepts a permutation 
6 and a rank test r, which may be specified either 

• by a list of posets P\ , . . . , P& (pre-convex) , 

• or by a semigraphoid A4 (convex rank test), 

• or by a submodular function w : 2^ — > R, 

• or by a collection K, of subsets of [n] (MSS), 

• or by a graph G on [n] (graphical test). 
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The output of our program has two parts. First, it gives the number |£(Pj)| of linear extensions, 
where the poset Pi represents the equivalence class of S n specified by the data tt. It also gives 
a representation of the distributive lattice L{Pi), in a format that can be read by the maple 
package posets |20j . Our software for Algorithms 1241 and 1261 and, more generally, for applying 
convex rank tests r to data vectors u S W 1 is available at bio.math.berkeley.edu/ranktests/. 

In closing let us give a concrete illustration of our current ability to count linear extensions. 
We computed the number of linear extensions of the Boolean poset P = 2^ consisting of all 
subsets of {1, 2, 3, 4, 5}. Our program ran in less than one second on a laptop and found that 

|L(2^)| = 14,807,804,035,657,359,360. 

This computation was inspired by work in population genetics by Daniel Weinreich [25] who 
reports the analogous calculation for P = 2^. 

Conclusions 

This work describes the connections among algebraic combinatorics, non-parametric statistics 
and graphical models (statistical learning theory). Specifically, we have proved the equivalence 
between semigraphoids and convex rank tests. This result provides the background for the 
counterexamples given in [9] and the rank tests which were applied to biological data in [14]. 
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